The Implementation and Optimization of Irregular Application Task Models Based on the Cell BE Processor

نویسندگان

  • Ji-Lin Zhang
  • Yi Chen
  • Jue Wang
  • En-Yi Liu
  • Yu-Yu Yin
  • Yong-Jian Ren
  • Hong Li
  • Cai-Hong Li
چکیده

The task parallelization is proposed in the OpenMP3.0 specification, which aims to resolve the irregular parallel computing problems. This paper presents the novel irregular application task model on Cell BE processors, which could reduce the difficulty of irregular applied parallel programming. In this Model, the two kinds of optimization techniques for maximum task numbers and maximum recursive layer are realized to avoid producing a large amount of fine-grained tasks and improve effectively the program performance of parallel execution. The experimental results show that the speedup of the typical irregular application is increased to 5.3 in a single Cell processor with six SPEs. Keyword: Heterogeneous Multi-Core, Irregular, Task Parallel, Breadth-First

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Toolbox for Generating Realistic Biological Cell Geometries for Electromagnetic Microdosimetry

Researchers in bioelectromagnetics often require realistic tissue, cellular and sub-cellular geometry models for their simulations. However, biological shapes are often extremely irregular, while conventional geometrical modeling tools on the market cannot meet the demand for fast and efficient construction of irregular geometries. We have designed a free, user-friendly tool in MATLAB that comb...

متن کامل

Validation and application of empirical shear wave velocity models based on standard penetration test

Shear wave velocity is a basic engineering tool required to define dynamic properties of soils. In many instances it may be preferable to determine Vs indirectly by common in-situ tests, such as the Standard Penetration Test. Many empirical correlations based on the Standard Penetration Test are broadly classified as regression techniques. However, no rigorous procedure has been published for c...

متن کامل

Design and Implementation of Field Programmable Gate Array Based Baseband Processor for Passive Radio Frequency Identification Tag (TECHNICAL NOTE)

In this paper, an Ultra High Frequency (UHF) base band processor for a passive tag is presented. It proposes a Radio Frequency Identification (RFID) tag digital base band architecture which is compatible with the EPC C C2/ISO18000-6B protocol. Several design approaches such as clock gating technique, clock strobe design and clock management are used. In order to reduce the area Decimal Matrix C...

متن کامل

Fast Cellular Automata Implementation on Graphic Processor Unit (GPU) for Salt and Pepper Noise Removal

Noise removal operation is commonly applied as pre-processing step before subsequent image processing tasks due to the occurrence of noise during acquisition or transmission process. A common problem in imaging systems by using CMOS or CCD sensors is appearance of  the salt and pepper noise. This paper presents Cellular Automata (CA) framework for noise removal of distorted image by the salt an...

متن کامل

Application of Task Complexity Along +/- single Task Dimension and its Effect on Fluency in Writing Performance of Iranian EFL Learners

In the present study, two different models of task complexity; namely, limited attentional capacity model and cognition hypothesis were examined. To this end, the manipulation of cognitive task complexity along +/- single task dimension on Iranian EFL learners’ production in terms of fluency was explored. Based on the results of the writing test of TOFEL (2004), 48 learners were selected as the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012